SQuaScheD -- Additional Material

SQuaScheD - Unsupervised Schema Discovery for Heterogeneous Data

We provide on this web site additional material related to the article "Here is the Data. Where is its Schema?" submitted to the 24th International WWW Conference 2015 as submission #312.

You will find below the detailed hierarchies, ground truth class distributions and MDL evolution for hierarchies discovered by SQuaScheD on all datasets mentioned in the paper.

Detailed SQuaScheD Discovered Hierarchies

We present below interactive visualizations showing the most representative attributes and entities for each class of the SQuaScheD discovered hierarchies for each datasets:

Ground Truth Class Distribution

Distribution of the bottom-most ground-truth class in the discovered class hierarchy for all datasets.

ActivityEducationalInstitution

Ground Truth

SQuaScheD

ArchitecturalStructure

Ground Truth

SQuaScheD

Event

Ground Truth

SQuaScheD

Event_NaturalPlace_WrittenWork

Ground Truth

SQuaScheD

Infrastructure

Ground Truth

SQuaScheD

RouteOfTransportation

Ground Truth

SQuaScheD

Species

Ground Truth

SQuaScheD

Tunnel

Ground Truth

SQuaScheD

MDL Evolution in SQuaScheD

The figures below show the evolution of the MDL, class-precision, -recall, and -F2 along the steps of the SQUASCHED process for all datasets.

ActivityEducationalInstitution

ArchitecturalStructure

Event_NaturalPlace_WrittenWork

Event

Infrastructure

RouteOfTransportation

Species

Tunnel